Direct and indirect evidence of compression of word lengths. Zipf’s law of abbreviation revisited

نویسندگان

چکیده

Zipf’s law of abbreviation, the tendency more frequent words to be shorter, is one most solid candidates for a linguistic universal, in sense that it has potential being exceptionless or with number exceptions vanishingly small compared languages on Earth. Since pioneering research, this been viewed as manifestation universal principle communication, i.e. minimization word lengths, reduce effort communication. Here we revisit concordance written language abbreviation. Crucially, provide wider evidence holds also speech (when length measured time), particular 46 from 14 families. Agreement abbreviation provides indirect compression via theoretical argument prediction optimal coding. Motivated by need direct compression, derive simple formula random baseline indicating lengths are systematically below chance, across families and writing systems, independently unit measurement (length characters duration time). Our work paves way measure compare degree optimality languages.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

assessment of deep word knowledge in elementary and advanced iranian efl learners: a comparison of selective and productive wat tasks

testing plays a vital role in any language teaching program. it allows teachers and stakeholders, including program administrators, parents, admissions officers and prospective employers to be assured that the learners are progressing according to an accepted standard (douglas, 2010). the problems currently facing language testers have both practical and theoretical implications but the first i...

on direct sums of baer modules

the notion of baer modules was defined recently

Compression and the origins of Zipf's law of abbreviation

Languages across the world exhibit Zipf’s law of abbreviation, namely more frequent words tend to be shorter. The generalized version of the law an inverse relationship between the frequency of a unit and its magnitude holds also for the behaviors of other species and the genetic code. The apparent universality of this pattern in human language and its ubiquity in other domains calls for a theo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Glottometrics (Lüdenscheid. Print)

سال: 2023

ISSN: ['1617-8351', '2625-8226']

DOI: https://doi.org/10.53482/2023_54_407